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Abstract 

Feedback  control  systems  for  aerospace  applications  must  maintain  precise  control  despite 
uncertain  operating  conditions  and  unanticipated  circumstances  such  as  battle  damage.  These 
systems  must  be  designed  to  perform  robustly,  despite  uncertain  design  models  and  difficult  to 
analyze  nonlinear  effects.  They  must  also  be  capable  of  learning  and  adapting  when  accumulat¬ 
ing  data  indicates  that  previous  models  must  be  abandoned  and  that  existing  control  strategies 
must  be  changed.  We  present  recent  developments  that  address  the  need  for  data-driven  design 
methods  well  suited  to  situations  in  which  available  mathematical  models  are  poor  or  unreliable. 
These  innovative  data-driven  design  methods,  collectively  known  as  unfalsified  control  theory, 
facilitate  the  creation  of  robust  control  systems  that  learn,  discover  and  evolve  in  real  time  in 
order  to  rapidly  and  reliably  compensate  for  the  effects  of  battle  damage,  equipment  failures 
and  other  changing  circumstances.  Potential  applications  include  aircraft  stability  augmentation 
systems,  highly  maneuverable  aircraft  design,  missile  guidance  systems,  and  precision  pointing 
and  tracking  systems. 

“It  is  a  capital  mistake  to  theorise  before  one  has  data.  Insensibly  one  begins  to  twist 
facts  to  suit  theories  instead  of  theories  to  suit  facts.  ” 

Arthur  Conan  Doyle,  Sherlock  Holmes 


1  Introduction 

The  robust  multivariable  control  theory  that  has  evolved  over  the  past  quarter  century  includes 
methods  based  on  the  H0 0  fi/Km- synthesis,  and  BMI/LMI/IQC  theories.  The  robust  control  theory 
offers  a  major  improvement  over  earlier  algebraic  and  optimal  control  methods.  It  has  enabled  the 
design  of  controllers  with  greater  tolerance  of  uncertainty  in  system  model  and,  hence,  increased 
reliability.  Commercial  computer-aided  control  synthesis  tools  like  those  introduced  by  Chiang  and 
Safonov  [20,  21],  Balas  et  al.  [22]  and  Gahinet  et  al.  [23]  have  made  robust  control  synthesis  routine, 
and  because  of  this  aerospace  and  industrial  applications  have  now  become  commonplace.  Further, 
on-going  improvements  based  on  LMI/IQC  robust  control  problem  formulations  are  continuing  to 
expand  the  range  of  problems  that  can  be  cast  and  solved  within  the  robust  control  framework 
(e.g.,  [24]). 
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Yet,  despite  the  assurances  of  greater  uncertainty  tolerance  and  better  reliability,  the  existing 
Hoq  fi/Km- synthesis,  and  BMI/LMI/IQC  techniques  for  robust  control  design  have  an  Achilles  heel: 
They  are  introspective  theories.  They  derive  their  conclusions  based  on  assumed  prior  knowledge  of 
models  and  uncertainties.  They  are  dependent  of  the  premise  that  uncertainty  models  are  reliable, 
and  they  offer  little  guidance  in  the  event  that  experimental  data  either  invalidates  prior  knowledge 
of  uncertainty  bounds  or,  perhaps,  provides  evidence  of  previously  unsuspected  patterns  in  the  data. 
That  is,  the  standard  H0 0  p, / ATm-sy  nt  hesis ,  and  BMI/LMI/IQC  robust  control  techniques  fail  in 
the  all  too  common  situation  in  which  prior  knowledge  is  poor  or  unreliable. 

Data-driven  design  tools  are  needed  to  make  the  overall  robust  control  design  process  more 
complete  and  reliable.  Ideally,  these  tools  should  incorporate  mechanisms  for  evaluating  the  design 
implications  of  each  new  experimental  data  point,  and  for  directly  integrating  that  information  into 
the  mathematics  of  the  robust  control  design  process  to  allow  methodical  update  and  re-design  of 
control  strategies  so  as  to  accurately  reflect  the  implications  of  new  or  evolving  experimental  data. 
Recent  thrusts  in  this  direction  are  control-oriented  identification  theory  and  [25,  26,  27,  28,  29, 
30,  31,  32,  33,  34,  35,  36,  37,  38,  39,  40,  41,  42,  43,  44]  and,  more  recently,  unfalsified  control 
[45,  46,  47,  48,  49,  5,  6,  7,  8,  9,  10,  11,  12,  13,  14,  15,  14,  16]  While  both  theories  are  concerned  with 
the  difficult  problem  of  assimilating  real-time  measurement  data  into  the  otherwise  introspective 
process  of  robust  control  design,  the  unfalsified  control  theory  is  a  particular  interest  because  it 
directly  and  precisely  characterizes  the  control  design  implications  of  experimental  data. 


2  Data-Driven  Robust  Control  Design 


Open-loop  DATA 


Need  good 
DATA-DRIVEN 
tools  to  close  the  design 
loop? 


Control  Design  Done 


Figure  1:  The  data-driven  theory  of  unfalsified  control  closes  data-driven  portion  of  the  design  loop 
by  focusing  squarely  and  precisely  on  the  control  design  implications  of  data. 
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Validation  or  more  precisely  unfalsification  —  of  hypotheses  against  physical  data  is  the 
central  aspect  of  the  process  of  scientific  discovery.  This  validation  process  allows  scientists  to 
sift  the  elegant  tautologies  of  pure  mathematics  in  order  to  discover  mathematical  descriptions  of 
nature  that  are  not  only  for  logically  self-consistent,  but  also  consistent  with  physically  observed 
data.  This  data-driven  process  of  validation  is  also  a  key  part  engineering  design.  Successful 
engineering  design  techniques  inevitably  arrive  at  a  point  where  pure  introspective  theory  and 
model-based  analyses  must  be  tested  against  physical  data.  But,  in  control  engineering  in  particular, 
the  validation  process  is  one  that  has  been  much  neglected  by  theoreticians.  Here,  the  theory 
tying  control  designs  to  physical  data  has  for  the  most  part  focused  on  pre-control-design  ‘system 
identification’.  Otherwise,  the  mathematization  of  the  processes  of  post-design  validation  and  re¬ 
design  has  remained  relatively  unexplored  virgin  territory.  In  particular,  a  satisfactory  quantitative 
mathematical  theory  for  direct  feedback  of  experimental  design-validation  data  into  the  control 
design  process  has  been  lacking,  though  this  seems  to  be  changing  with  the  recent  introduction  of 
a  theory  of  unfalsified  control  [49] . 

Theory:  Validation  and  Unfalsification 

Unfalsified  control  is  essentially  a  data-driven  adaptive  control  theory  that  permits  learning  based 
on  physical  data  via  a  process  of  elimination,  much  like  the  candidate  elimination  algorithm  of 
Mitchell  [50,  51].  The  theory  concerns  the  feedback  control  configuration  in  Figure  2.  As  always 
in  control  theory,  the  goal  is  to  determine  a  control  law  K  for  the  plant  P  such  that  the  closed-loop 
system  response,  say  T,  satisfies  given  specifications.  Unfalsified  control  theory  is  concerned  with 
the  case  in  which  the  plant  is  either  unknown  or  is  only  partially  known  and  one  wishes  to  fully 
utilize  information  from  measurements  in  selecting  the  control  law  K.  In  the  theory  of  unfalsified 
control,  learning  takes  place  when  new  information  in  measurement  data  enables  one  to  eliminate 
from  consideration  one  or  more  candidate  controllers. 


Figure  2:  Feedback  control  system. 

The  three  elements  that  define  the  unfalsified  control  problem  are  (1)  plant  measurement  data, 
(2)  a  class  of  candidate  controllers,  and  (3)  a  performance  specification,  say  TspeC)  consisting  of  a 
set  of  admissible  3-tuples  of  signals  (r,  y,  u).  More  precisely,  we  have  the  following. 

Definition  [49]  A  controller  K  is  said  to  be  falsified  by  measurement  information  if  this  infor¬ 
mation  is  sufficient  to  deduce  that  the  performance  specification  (r,  y,u)  €  Tspec  Vr  E  7Z  would 
be  violated  if  that  controller  were  in  the  feedback  loop.  Otherwise ,  the  control  law  K  is  said  to  be 

unfalsified.  □ 

To  put  plant  models,  data  and  controller  models  on  an  equal  footing  with  performance  spec¬ 
ifications,  these  like  Tspec  are  regarded  as  sets  of  3-tuples  of  signals  (r,  y ,  u)  —  that  is,  they  are 
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regarded  as  relations  in  TZ  x  y  x  U.  For  example,  if  P  :  U  — ►  y  and  K  :  1Z  x  y  — >  U  then 


P  = 


K  = 


(r,  y ,  u) 


u  —  K 


And,  if  J(r,y,u )  is  a  given  loss-function  that  we  wish  to  be  non-positive,  then  the  performance 
specification  Tspec  would  be  simply  the  set 


'  spec 


=  {  (r,y,u)  |  J(r,y,  u)  <  0  }  . 


(1) 


On  the  other  hand,  experimental  information  from  a  plant  corresponds  to  partial  knowledge  of  the 
plant  P.  Loosely,  data  may  be  regarded  as  providing  a  sort  of  an  “interpolation  constraint”  on 
the  graph  of  P  —  i.e.,  a  ‘point’  or  set  of  ‘points’  through  which  the  infinite-dimensional  graph  of 
dynamical  operator  P  must  pass. 

Typically,  the  available  measurement  information  will  depend  on  the  current  time,  say  r.  For 
example,  if  we  have  complete  data  on  (u,  y )  from  time  0  up  to  time  r  >  0,  then  the  measurement 
information  is  characterized  by  the  set  [49] 


p.  — 

'  data,  — 


(r,y,u)  ellxU  xy 


Pr 


{u  Udata ) 
( y  ydata) 


(2) 


where  PT  is  the  familiar  time-truncation  operator  of  input-output  stability  theory  (cf.  [52,  53]), 
viz., 

IP  r](f\  A  f  x(t),  if  o  <t<T 

T  i  0,  otherwise. 

The  main  result  of  unfalsified  control  theory  is  the  following  theorem  which  gives  necessary  and 
sufficient  conditions  for  past  open-loop  plant  data  Pdata  to  falsify  the  hypothesis  that  controller  K 
can  satisfy  the  performance  specification  Tspec. 


Unfalsified  Control  Theorem  [49]  A  control  law  K  is  unfalsified  by  measurement  information 
Pdata  if>  and  only  if,  for  each  triple  (ro,yo5^o)  £  Pdata  H  K,  there  exists  at  least  one  pair  (uo,yo) 
such  that 

(^*0)  yO}  Uq)  ^  Pdata  Fl  K  O  ~P spec'  (3) 

Proof:  With  controller  K  in  the  loop,  a  command  signal  ro  G  TZ  could  have  produced  the  measurement 
information  if,  and  only  if,  (ro,ycb^o)  €  Pdata^K  for  some  (uo,yo).  The  controller  K  is  unfalsified 
if  and  only  if  for  each  such  ro  there  is  at  least  one  (possibly  different)  pair  (ui,yi)  which  also  could 
have  produced  the  measurement  information  with  K  in  the  loop  and  which  additionally  satisfies  the 
performance  specification  (ro,yi,^i)  E  Tspec.  That  is,  K  is  unfalsified  if  and  only  if  for  each  such 
ro,  condition  (3)  holds.  □ 


The  Unfalsified  Control  Theorem  constitutes  a  mathematically  precise  statement  of  what 
it  means  for  experimental  data  and  a  performance  specification  to  be  inconsistent  with  a  particular 
controller.  It  has  some  interesting  implications: 

•  The  Unfalsified  Control  Theorem  is  nonconservative;  i.e.,  it  gives  “if  and  only  if”  con¬ 
ditions  on  K.  It  uses  all  the  information  in  the  past  data  —  and  no  more.  It  provides  a 
mathematically  precise  “sieve”  which  rejects  any  controller  which,  based  on  experimental 
evidence,  is  demonstrably  incapable  of  meeting  a  given  performance  specification. 
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•  The  Unfalsified  Control  Theorem  is  “model  free”.  No  plant  model  is  needed  to  test  its 
conditions.  There  are  no  assumptions  about  the  plant. 

•  Information  Pdata  which  invalidates  a  particular  controller  K  need  not  have  been  generated 
with  that  controller  in  the  feedback  loop;  it  may  be  open  loop  data  or  data  generated  by  some 
other  control  law  (which  need  not  even  be  in  K). 

•  When  the  sets  Pdata 5  K  and  Tspec  are  each  expressible  in  terms  of  equations  and/or  inequalities, 
then  falsification  of  a  controller  reduces  to  a  minimax  optimization  problem.  For  some  forms 
of  inequalities  and  equalities  (e.g.,  linear  or  quadratic),  this  optimization  problem  may  be 
solved  analytically,  leading  to  procedures  for  direct  identification  of  controllers  —  as  the 
example  in  [7]. 

Data-Driven  Learning  and  Adaptive  Control 

The  unfalsified  control  theorem  says  simply  that  controller  falsification  can  be  tested  by  computing 
an  intersection  of  certain  sets  of  signals.  A  noteworthy  feature  of  the  unfalsified  control  theory  is 
that  a  controller  need  not  be  in  the  loop  to  be  falsified.  Broad  classes  of  controllers  can  be  falsified 
with  open-loop  plant  data  or  even  data  acquired  while  other  controllers  were  in  the  loop.  Adaptive 
control  is  achieved  within  the  this  framework  by  using  the  unfalsification  process  as  the  key  element 
of  a  supervisory  controller  (cf.  [54,  55]).  The  supervisor  switches  an  unfalsified  controller  into  the 
feedback  loop  whenever  the  current  controller  in  the  loop  is  amongst  those  falsified  by  observed 
plant  data  —  see  Fig.  3. 


r 


LEARNING  FEEDBACK  LOOPS 


evolving  I/O  data 


FALSIFIED 


candidate 

controllers 


given 

iwrrormctnw 
spec's 


^  SIEVE 


22^ 


Unfalsified 

Controllers 


Figure  3:  The  unfalsification  process  mathematically  sifts  controllers  to  find  those  that  are  consis¬ 
tent  with  both  performance  goals  and  physical  data.  It  plays  the  role  of  a  ‘supervisor’  that  chooses 
one  of  the  currently  unfalsified  controllers  to  put  in  the  aircraft’s  control  loop. 

The  step  from  the  Unfalsified  Control  Theorem  to  adaptive  control  is,  conceptually  at  least, 
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a  small  one.  Simply  choosing  as  the  current  control  law  one  that  is  not  falsified  by  the  past  data 
produces  a  control  law  that  is  adaptive  in  the  sense  that  it  learns  in  real  time  and  changes  based 
on  what  it  learns. 

Like  the  controllers  of  [56,  57],  this  approach  to  adaptive  real-time  unfalsified  control  leads  to 
a  sort  of  “switching  control.”  Controllers  which  are  determined  to  be  incapable  of  satisfactory 
performance  are  switched  out  of  the  feedback  loop  and  replaced  by  others  which,  based  on  the 
information  in  past  data,  have  not  yet  been  found  to  be  inconsistent  with  the  performance  speci¬ 
fication.  However,  adaptive  unfalsified  controllers  generally  would  not  be  expected  to  exhibit  the 
poor  transient  response  associated  with  switching  methods  such  as  [56].  The  reason  is  that,  unlike 
the  theory  in  [56] ,  unfalsified  control  theory  efficiently  eliminates  broad  classes  of  controllers  before 
they  are  ever  inserted  in  the  feedback  loop.  The  main  difference  between  unfalsified  control  and 
other  adaptive  methods  is  that  in  unfalsified  control  one  evaluates  candidate  controllers  objectively 
based  on  experimental  data  alone,  without  prejudicial  assumptions  about  the  plant. 

While,  in  principle,  the  unfalsified  control  theory  allows  for  the  set  K  to  include  continuously 
parameterized  sets  of  controllers,  restricting  attention  to  candidate  controller  sets  K  with  only  a 
finite  number  of  elements  can  simplify  computations.  Further  simplifications  result  by  restricting 
attention  to  candidate  controllers  that  are  “causally- left-invertible”  in  the  sense  that,  given  a  K  G  K, 
the  current  value  of  r{t)  is  uniquely  determined  by  past  values  of  u(t),y(t).  When  (2)  holds,  these 
restrictions  on  Tspec  and  K  are  sufficient  to  permit  the  unfalsified  set  to  be  evaluated  in  real-time 
via  the  following  conceptual  algorithm. 


Algorithm  (Recursive  Adaptive  Control) 

Input: 


•  A  finite  set  K  of  m  candidate  dynamical  controllers  Ki(r,y,u)  =  0,  (i  =  1, . . . ,  m)  each  having 
the  causal-left-invertibility  property  that  r(t)  is  uniquely  determined  from  Ki{r,y,u)  =  0  by 
past  values  of  u{t),y{t). 

•  Sampling  interval  At  and  current  time  r  —  nAt; 

•  Plant  data  (u(t),y(t)),  t  E  [0, r\; 

•  Performance  specification  set  Tspec  consisting  of  the  set  of  triples  (r,  y ,  u )  satisfying  the  in¬ 
equalities 

rkAt 

/  fspec(r(t),y(t),u(t),t)  dt  <  0,  Vfc  =  l,...,n. 

Jo 

Initialize: 

set  k  —  0,  set  i  =  m; 

for  i  —  0  :  m,  set  s(i )  —  1,  set  J(i )  —  0,  end. 

Procedure: 

while  i  >  0; 

k  =  k  +  1; 

for  i  —  1  :  m; 

if  s(i)  >  0; 
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for  each  t  E  [(k  —  1)A t,  fcAt]; 
soZz;e  jFQ(r,  ?/,  zz)  —  0  for  r(t); 

(note  that  r(t )  exists  and  is  unique  since  K{  has  the  causal-left-invertibility 
property) 
end ; 

J{i)  —  J(i)  +  J^kAt1)Atfspec(r(t),y(t),u(t),t)  dt\ 
if  J{i)  >  0,  set  s(i )  —  0,  end; 

end; 

end; 

i  =  max  |  z  |  s(i)  >  0  j; 

end. 

This  algorithm  returns  for  each  time  the  least  index  i  for  which  Kr  is  unfalsified  by  the  past 
plant  data.  Real-time  unfalsified  adaptive  control  is  achieved  by  always  taking  as  the  currently 
active  controller 


provided  that  the  data  does  not  falsify  all  candidate  controllers.  In  this  latter  case,  the  algorithm 
terminates  and  returns  i  =  0. 

It  is  important  to  note  that  while  the  above  algorithm  is  geared  towards  the  case  of  an  integral 
inequality  performance  criterion  Tspec  and  a  finite  set  of  causally-left-invertible  Kf  s,  the  underlying 
theory  is,  in  principle,  applicable  to  arbitrary  non-finite  controller  sets  K  and  to  hybrid  systems 
with  both  discrete  and  continuous  time  elements. 

Comment  If  the  plant  is  slowly  time-varying,  then  older  data  ought  to  be  discarded  before  eval¬ 
uating  controller  falsification.  This  may  be  effected  within  the  context  of  our  Algorithm  by  fixing 
r  =  to  and  regarding  t  —  tq  as  the  deviation  from  the  current  time.  The  result  is  a  an  algorithm 
which  only  considers  data  from  moving  time-window  of  fixed  duration  To  time-units  prior  to  the 
current  real-time.  In  this  case  the  unfalsified  controller  set  K ok  no  longer  shrinks  monotonically 
as  it  would  if  r  were  increasing  in  lockstep  with  real-time. 

Conceptual  Issues 

uHeavier-than-air  flying  machines  are  impossible.” 

Lord  Kelvin,  President,  British  Royal  Society,  1895 

A  typical  initial  response  from  a  knowledgeable  academic  researcher  might  be  to  dismiss  unfal¬ 
sified  control  theory  out  of  hand  as  a  sort  of  mathematical  chicanery.  The  claim  that  unfalsified 
control  permits  control  design  without  a  plant  models  has  tended  to  be  regarded  as  too  outlandish 
to  be  taken  seriously.  Certainly  it  is  true  that  unfalsified  control  theory  has  its  limitations  —  and 
that  the  theory  needs  improvement.  But,  such  objections  to  unfalsified  control  are  fallacious  — 
based  on  intuition  derived  from  inappropriate  analogies.  Thoughtful  control  theorists  have  been 
genuinely  surprised  and  impressed  by  the  simplicity  and  power  of  the  unfalsified  control  theory 
as  a  mathematical  basis  for  explaining  feedback  and  learning,  as  a  practical  method  for  designing 
more  reliable  adaptive  controllers,  and  as  a  data-driven  technique  for  off-line  tuning  of  non- adaptive 
feedback  control  gains. 

Following  are  some  typical  examples  of  fallacious  objections  to  unfalsified  control: 
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1.  Unfalsified  control  seems  unacceptably  weak  in  its  conclusion  of  mere  unfalsification,  given 
that  familiar  theories  of  control  seem  to  offer  stronger  predictions  like  global  stability  and  op¬ 
timality  derived  deductively  through  the  analysis  of  models  even  without  the  aid  of  validating 
data. 

However,  this  objection  fails  to  recognize  the  fundamental  distinction  between  conclusions 
deduced  from  model  or  assumptions  and  conclusions  obtained  via  a  mathematical  analysis  of 
experimental  data:  Beliefs  about  the  validity  of  models  and  assumptions,  and  therefore  any 
conclusions  based  in  whole  or  in  part  on  mathematical  models,  are  not  necessarily  scientific 
truths  —  they  might  be  falsified  by  future  physical  data.  Unfalsified  control  augments  intro¬ 
spective  model-based  robust  control  design  methods  by  providing  a  quantitative  methodology 
for  closing  the  loop  on  the  control  design  process  when,  at  the  experimental  validation  stage, 
the  model-based  robust  control  design  proves  to  be  unsatisfactory. 

2.  Unfalsified  control  theory  incorporates  no  sensor  noise  models,  and  therefore  must  perform 
poorly. 

In  unfalsified  control,  control  system  performance  criteria  are  framed  directly  in  terms  of 
observed  variables.  This  runs  counter  to  established  intuition  for  some  control  theorists  who 
have  grown  overly  comfortable  with  traditional  control  problem  formulations  that  characterize 
physical  measurements  as  noise-corrupted  observations  of  the  unseen  internal  ‘reality’  of  a 
‘true’  model.  Given  this  tradition  of  regarding  models  and  noise  beliefs  as  having  more  truth 
content  than  physical  observations,  it  has  been  easy  to  succumb  to  the  temptation  to  assume 
that  the  models  and  their  noise  are  the  ‘true’  explanation  of  observed  physical  data.  The  fact 
is  that  unfalsified  control  does  accommodate  noise  quite  well  when  suitably  ‘soft’  performance 
criteria  are  employed.  And,  moreover,  the  performance  criteria  used  in  unfalsified  control  are 
quite  flexible,  and  may  even  be  associated  with,  and  derived  from,  traditional  stochastic  noise 
hypotheses. 

3.  Unfalsified  adaptive  controllers  are  claimed  to  be  quick  and  sure-footed  in  discovering  good 
control  gains,  even  for  non-minimum-phase  plants.  This  is  too  good  to  be  true,  and  so  must 
be  false. 

This  erroneous  belief  apparently  arises  from  knowledge  that  popular  model  reference  adaptive 
control  schemes  are  relatively  sluggish  and  have  been  proved  to  fail  for  non-minimum-phase 
plants.  But,  unfalsified  control  is  not  model  reference  control  and  does  not  suffer  its  limits. 
It  is  fast  and  sure-footed  because,  unlike  other  adaptive  schemes  such  as  model  reference  con¬ 
trol,  unfalsified  control  theory  is  based  on  a  precise  analysis  of  the  mathematical  constraints 
induced  by  (1)  performance  criteria,  (2)  physical  data  and  (3)  the  control  law. 

4.  Unfalsified  adaptive  controllers  make  use  of  the  inverse  of  the  controller  transfer  function,  so 
they  must  be  sensitive  to  plant  model  error  and  noise. 

Apparently  some  control  theorists  are  confused  by  the  thought  of  controller  inversion,  since 
it  triggers  unrelated  memories  concerning  known  difficulties  with  controllers  that  rely  on 
inverting  the  plant  itself.  In  any  case,  the  controller  inversion  is  not  an  essential  part  of 
theory  but  merely  one  of  several  conceivable  ways  to  perform  the  computations  that  are 
necessarily  associated  with  any  logically  correct  test  of  consistency  a  controller  hypothesis 
against  performance  goals  and  physical  data. 

5.  Anyone  who  claims  to  be  able  to  design  a  controller  without  a  mathematical  model  of  the  plant 
must  be  a  charlatan,  ergo  unfalsified  control  must  be  the  product  of  charlatans.  A  seasoned 
reviewer  of  our  paper  [49]  put  this  objection  very  eloquently,  saying 
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“Modern  science  is  model-based.  If  to  abandon  models  is  not  to  abandon  mathe¬ 
matical  science. . .  ” 

In  such  arguments,  one  may  perhaps  glimpse  elements  of  the  conflict  between  the  introspec¬ 
tive  belief-driven  methods  of  ancient  Platonic  science  and  the  observation-driven  methods 
of  post-Galileo  experimental  science.  Such  fallacious  views  are  based  on  the  knowledge  that 
established  control  theories  (like  the  theories  of  Plato)  are  heavily  introspective,  relying  on 
models  and  assumptions  -  much  like  modern  control  theorists  must  rely  on  augmenting  evi- 
dentce  with  models  and  assumptions  in  order  to  deduce  such  things  as  stability  or  optimality. 
Platonism  notwithstanding,  the  unfalsified  control  method  makes  it  possible  to  devise  de¬ 
pendable  data-driven  methods  for  discovering  good  control  designs  without  models  via  care¬ 
ful  mathematical  analysis  of  the  logical  implications  of  experimental  observation  alone.  They 
also  fail  to  notice  that  models  play  a  key  role  in  the  unfalsified  control  method,  but  that  the 
models  used  in  unfalsified  control  are  models  of  candidate  controller  hypotheses,  not  plants. 
Indeed,  a  careful  comparison  of  unfalsified  control  theory  and  system  identification  theory 
shows  that  they  are  conceptually  the  same,  except  that  in  unfalsified  control  one  identifies 
controller  models  (not  plant  models)  and  performance  criteria  involve  closed-loop  control 
errors  (not  open-loop  plant  model  errors). 

The  foregoing  are  representative  of  fallacious  criticisms  that  have  been  levied  by  experienced  nay 
sayers  who  were  convinced  that  unfalsified  control  is  a  heavier-than-air  theory  that  could  not 
possibly  fly.  They  were  wrong.  Unfalsified  control  theory  is  taking  off.  It  is  proving  to  be  a 
legitimate  and  useful  vehicle  for  data-driven  control  design,  even  if  the  initial  flights  have  been 
somewhat  ungainly  and  short. 

Design  Studies 

The  1895  declaration  of  the  Lord  Kelvin  not  withstanding,  the  Wright  brothers  flew  in  1903.  As 
for  unfalsified  control  theory,  accumulating  case  study  evidence  is  likewise  proving  the  intuition  of 
experienced  nay  sayers  to  be  wrong.  Over  the  past  three  years  several  design  studies  have  confirmed 
the  theoretical  expectation  that  unfalsified  control  can  be  useful  in  closing  the  outer  data-driven 
loop  on  the  control  design  process.  Unfalsified  control  theory  has  proved  effective  in  applications 
involving  both  off-line  controller  gain  tuning  and  in  real-time  adaptive  control  design  studies.  These 
initial  design  studies  have  helped  us  to  better  understand  the  potential  of  the  unfalsified  control 
theory,  as  well  as  limitations  of  the  current  theory. 

Missile  Autopilot 

One  design  study  that  we  conducted  involved  using  an  unfalsified  controller  to  robustly  discover  PID 
controller  gains  for  an  adaptive  missile  autopilot  ‘on  the  fly’  in  real-time  [9].  Figure  4  summarizes 
the  results  of  the  missile  design.  In  all  trials,  the  response  of  the  adaptive  loops  was  swift  and  sure¬ 
footed  —  in  stark  contrast  to  what  would  be  expected  from  traditional  model  reference  adaptive 
methods. 

Robot  Manipulator  Arm 

We  used  to  unfalsified  methodology  to  adaptively  tune  the  parameters  of  a  nonlinear  ‘computed- 
torque’  controller  for  a  robot  manipulator  arm  [58,  5]  —  see  Fig  fig:robot.  The  arm  proved  to  be 
capable  of  a  quick  and  reliable  control  response  despite  large  and  sudden  variations  in  load  mass. 
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•  Leams  control  gains 

•  Adapts  quickly  to 
compensate  for 
damage  &  failures 

•  Superior  performance 


BiugaroLis,  Fromion  &  S  aforcvv,  ACC  ’93 

Unfalsified  adaptive  missile  autopilot: 


•  discovers  stabilizing  control  gains 
as  it  flies,  nearly  instantaneously 

•  maintains  precise  sure-footed  control 


Figure  4:  A  data-driven  unfalsified  missile  controller  would  have  abilities  to  adaptively  discover 
solutions  in  real-time  to  compensate  for  sudden  in-flight  changes  and  damage. 


Again,  the  controller  performed  with  precision,  despite  noise,  dynamical  actuator  uncertainties 
and  without  prior  knowledge  of  the  plant  model  or  its  parameters.  Results  for  the  robot  design 
were  surefooted  and  precise,  with  the  controller  maintaining  an  order  of  magnitude  more  precise 
control  than  a  similar  model-reference  adaptive  controller  during  widely  fluctuating  manipulator 
load  variations;  the  controller  was  also  more  robust  in  that  it  was  capable  of  maintaining  precise 
control  even  during  load  variations  that  destabilized  a  similarly  structured  model-reference  adaptive 
controller. 


Industrial  Process  Control 

Although  very  few  researchers  other  than  ourselves  have  as  yet  examined  unfalsified  control  meth¬ 
ods,  those  who  have  taken  this  step  have  predictably  confirmed  the  effectiveness  of  unfalsified 
control  methods  in  several  industrial  process  control  applications.  For  example,  Kosut  [41]  exam¬ 
ined  unfalsified  controller  for  direct  data-driven  off-line  control  gain  tuning  under  the  assumption 
of  a  noise- free  linear-time-invariant  plant.  Woodley,  How  and  Kosut  [59]  and  used  the  theory  with 
good  result  for  data-driven  discovery  of  good  control  gains  for  a  laboratory  control  problem  in¬ 
volving  two  spring-connected  masses.  Also,  Collins  and  Fan  [42]  successfully  used  the  unfalsified 
control  methodology  in  a  run-to-run  setting  to  tune  gains  off-line  in  an  industrial  weigh-belt  feeder 
control  design  study.  More  recently,  there  have  been  some  promising  adaptive  control  applications 
to  machine  control  by  Razavi  and  Kurfess  [43,  60]  based  on  the  unfalsified  control  methodology. 
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Figure  5:  Unfalsified  control  produced  superior  results  for  a  nonlinear  two-link  robot  manipulator 
subject  to  uncertain  dynamics,  noisy  disturbances  and  abrupt  changes  in  load  mass.  The  two  slug¬ 
gishly  smooth  traces  large  amplitude  signals  in  the  plot  are  with  a  conventional  adaptive  controller 
used  to  adjust  control  gain- vector  6{t ),  and  the  two  very  low  amplitude  traces  are  for  the  unfalsified 
controller.  The  unfalsified  controller  had  a  much  quicker,  sure-footed  and  precise  response  without 
increased  control  effort. 


Universal  PID  Controller 

One  application  of  the  theory  involved  implementing  a  PID-based  adaptive  ‘universal’  controller 
implemented  as  Matlab  Simulink  block  based  on  the  unfalsified  theory  [10]  —  see  Fig  6.  The 
controller  was  capable  of  sifting  through  a  bank  of  candidate  controllers  in  real-time,  stabilizing  an 
open-loop  unstable  plant  without  knowledge  of  the  plant  model  despite  sensor  noise  and  without 
noticeable  transients. 

3  Discussion 

In  our  unfalsified  control  theory,  decisions  to  adapt  are  data-driven.  Determination  of  which  can¬ 
didate  control  laws  are  suitable  are  made  based  on  experimental  evidence,  i.e.,  the  actual  values  of 
sensor  output  signals  and  actuator  input  signals.  In  this  process  the  role,  if  any,  of  plant  models 
and  of  probabilistic  hypotheses  about  stochastic  noise  and  random  initial  conditions  is  entirely  an 
a  priori  role:  These  provide  concepts  which  are  useful  in  selecting  the  class  K  of  candidate  controllers 
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candidate  controJler 


Figure  6:  In  one  study,  we  designed  an  adaptive  ‘universal  controller5  having  a  PID  structure 
and  based  on  the  unfalsified  control  theory.  Simulations  using  Matlab  Simulink  showed  that  the 
adaptive  unfalsification  loops  were  so  fast  that  the  controller  was  able  to  stabilize  an  unstable  plant 
without  prior  plant  knowledge  and  without  appreciable  transients. 


and  in  selecting  achievable  goals  (i.e.,  selecting  Tspec ).  The  methods  of  traditional  model-based  con¬ 
trol  theories  (root  locus,  stochastic  optimal  control,  Bode-Nyquist  theory,  robust  control  and 
so  forth)  provide  mechanizations  of  this  prior  selection  and  narrowing  process.  Unfalsified  control 
takes  over  where  traditional  model-based  methods  leave  off,  providing  a  mathematical  framework 
for  determining  the  proper  consequences  of  experimental  observations  on  the  choice  of  control  law. 
In  effect,  the  theory  gives  one  a  model-free  mathematical  “sieve”  for  candidate  controllers,  enabling 
us  (i)  to  precisely  identify  what  of  relevance  to  attaining  the  specification  Tspec  can  be  discovered 
from  experimental  data  alone  and  (ii)  to  clearly  distinguish  the  implications  of  experimental  data 
from  those  of  assumptions  and  other  prior  information. 

The  Unfalsified  Control  Theorem  explains  the  learning  mechanisms  of  adaptive  control 
theory.  It  provides  an  exact  characterization  of  what  can,  and  what  cannot,  be  learned  from 
experimental  data  about  the  ability  of  a  given  class  of  controllers  to  meet  a  given  performance 
specification.  A  salient  feature  of  the  theory  is  that  the  data  used  to  falsify  a  class  of  control 
laws  may  be  either  open-loop  data  or  data  obtained  with  other  controllers  in  the  feedback  loop. 
Consequently,  large  classes  of  candidate  controllers  are  falsified  by  even  a  few  experimental  samples 
of  plant  input  output  data.  Candidate  controllers  need  not  be  actually  inserted  in  the  feedback 
loop  to  be  falsified.  This  is  important  because  it  means  that  adaptive  unfalsified  controllers  will 
be  significantly  less  susceptible  to  poor  transient  response  than  adaptive  learning  algorithms  which 
require  inserting  controllers  in  the  loop  one-at-a-time  to  determine  if  they  are  unsuitable. 

A  noteworthy  feature  of  the  unfalsified  control  theory  is  its  flexibility  and  simplicity  of  imple¬ 
mentation.  Controller  falsification  typically  involves  only  real-time  integration  of  algebraic  functions 
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of  the  observed  data,  with  one  set  of  integrators  for  each  candidate  controller.  The  theory  may  be 
readily  applied  to  nonlinear  time-varying  plants,  as  well  as  to  linear  time-invariant  ones. 


4  Conclusion 

As  robust  control  theory  has  matured,  a  key  challenge  has  been  the  need  for  a  more  flexible  theory 
that  provides  a  unified  basis  for  representing  and  exploiting  evolving  information  flows  from  models, 
noisy  data,  and  more.  The  contribution  of  unfalsified  control  theory  has  been  to  close  the  loop  on 
the  adaptive  and  robust  control  design  processes  by  developing  data-driven  methods  to  complement 
traditional  model-based  methods  for  the  design  of  robust  control  systems.  The  theory  strengthens 
the  foundations  of  control  theory,  paving  the  way  for  carefully  reasoned  extensions  of  feedback 
control  theory  into  the  realm  of  intelligent  and  learning  systems.  Unfalsified  control  theory  lays  the 
foundation  for  a  more  complete  and  rigorous  understanding  of  feedback  that  focusing  squarely  on 
the  quantitative  design  implications  of  physical  data,  thereby  enabling  the  design  of  robust  control 
systems  with  the  ability  to  autonomously  enhance  both  their  robustness  and  their  performance  by 
exploiting  real-time  data  as  it  unfolds.  Such  designs  will  be  better  able  to  compensate  for  uncertain 
and  time-varying  effects,  battle  damage,  equipment  failures  and  other  changing  circumstances. 
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